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Starting with the same form of atomic nonUnearity Weinberg [Q] and Wodkiewicz and Scully Q 
obtained contradictory results concerning an evolution of the atomic inversion to in a two-level atom 
in Weinberg's nonlinear quantum mechanics: If the atom is initially in a ground state then either 
(1) the evolution of w can be linear if one uses a nonlinear generalization of the Jaynes-Cummings 
Hamiltonian, or (2) is always nonlinear if one uses the nonlinear Bloch equations derived from the 
I nonlinear atomic Hamiltonian function. It is shown that the difference is rooted in inequivalent 

descriptions of the composite "atom+field" system. The linear evolution of w results from a "faster- 
than-light communication" between the atom and the field. If one applies a description without the 
"faster-than-light telegraph" then the calculations based on a suitably modified Jaynes-Cummings 
C ■ Hamiltonian lead to the same dynamics of w as this found in semiclassical calculations based on Bloch 

^ ' equations. A physical background of the two approaches is discussed in detail, and two, inequivalent 

kinds of such "telegraphs" are described. It is shown also that a nonlinear quantum mechanics based 
on a nonlinear Schrodinger equation does not possess a natural probability interpretation. Various 
definitions of nonlinear eigenvalues and probabilities are discussed and illustrated with examples. 
" . Finally, I discuss problems that appear if we apply the above results to the "object +observer" 

' composite system, where the "observer" can evolve in a nonlinear way. 

o 
o 

^ 

o 
in 

Qs^ • I. INTRODUCTION 

^ ^ The purpose of the generalization of quantum mechanics proposed by S. Weinberg Q,^,^ "was not seriously to 
^ propose an alternative to quantum mechanics but only to have some theory whose predictions would be close to 
■ but not quite the same as those of quantum mechanics, to serve as a foil that might be tested experimentally" 
Next, in a series of very precise experiments the Weinberg's theory was indeed tested [||, showing no observable 
^ deviations from linearity. This was neither particularly surprising, as Weinberg admits in his recent popular book, 
nor the most discouraging feature of the new framework: "Even if there are small nonlinear corrections to quantum 
mechanics, there was no reason to believe that these corrections should be just large enough to show up in the first 
round of experiments designed to search for them. What I did find disappointing was that this nonlinear alternative 
to quantum mechanics turned out to have purely theoretical internal difficulties. For one thing, I could not find any 
' way to extend the nonlinear version of quantum mechanics to theories based on Einstein's special theory of relativity. 
Then, after my work was published, both N. Gisin in Geneva and my colleague Joseph Polchinski at the University 
of Texas independently pointed out that in the Einstein-Podolsky-Rosen thought experiment (...) the nonlinearities 
of the generalized theory could be used to send signals instantaneously over large distances, a result forbidden by 
special relativity. At least for the present I have given up the problem; I simply do not know how to change quantum 
mechanics by a small amount without wrecking it altogether" ||^. 

This quotation explains almost all purposes of this paper. In its first part I will try to point out the main internal 
theoretical dijficulties of those versions of nonlinear quantum mechanics (NLQM) that are based on a nonlinear 
Schrodinger equation. We shall see, in particular, that the problem of a correct description of composite systems 
in NLQM is not only related to the celebrated "faster-than-light telegraphs" mentioned in the above quotation, but 
may also lead to different predictions for two-level atoms, a basic theoretical tool used in calculations for actual 
experiments. 

Just to illustrate the latter let us notice the following apparent inconsistency. It has been shown in 0| that a 
system consisting initially of a single photon electromagnetic field and a two-level atom in a ground state, performs 
ordinary linear Rabi oscillations even in case the atomic Schrodinger equation contains a nonlinear term. On the 
other hand, the calcultions of Wodkiewicz and Scully Q starting with the same form of nonlinearity but based 
on the Bloch equations approach lead, with the same initial condition for the atomic inversion, to elliptic hence 
nonlinear oscillations of the inversion. It will be shown below that the difference is rooted simply in non-equivalent 
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descriptions of the composite "atom+field" system. The hnear evolution occurs if we use the description with an 
impUcit "faster-than-Ught telegraph", whereas the nonlinear one is found if no such "telegraphs" are present. 

The two descriptions are based on different physical assumptions which shall be discussed in detail below. To make 
matters worse, it will be shown that there does not exist a simple alternative between the two ways of describing 
compound systems. In fact, we will see that there exists an infinite number of inequivalent and still "correct" 
possibilities of describing systems such as the the atom and the field, if one assumes that the atom alone evolves 
according to some nonlinear Schrodinger equation. This arbitrariness not only leads to inequivalent descriptions of 
optical phenomena but also to fundamental difhculties in interpretation of experiments. 

Another non-unique element which additionally obscures the meaning of experiments is related to the probabilistic 
interepretation of NLQM based on a nonlinear Schrodinger equation. I will show that there does not exist a unique 
definition of eigenvalues or probabilities (propositions), and those used for interpreting experiments testing Weinberg's 
theory were not applied in an internally consistent way. 

In the second part of the paper 0| I will propose a new framework for NLQM. The proposed generalization is based 
on a kind of Nambu mechanics where nonlinearities can be introduced without any modification of observables. This 
formulation of NLQM will be free from the difhcultics of the previous approaches, no "faster-than-light telegraphs" 
will occur, the description of composite systems and the probalistic interpretation will be unique, and, last but not 
least, such a theory can be naturally formulated in a relativistic way. The results of both parts are based on the 
dissertation 

The following sections are devoted to a critical presentation of the basic theoretical concepts of NLQM. 



II. NONLINEAR QUANTUM MECHANICS A LA KIBBLE, WEINBERG, POLCHINSKI, AND JORDAN 

Adding a nonlinear term to the Schrodinger equation does not mean that we have a generalization of quantum 
mechanics. An evolution equation is only a part of a theory. A complete theory must give a concrete prescription for 
a translation of theoretical formulas into the language of experiment. In the context of quantum theories we must 
know first of all what are the mathematical cunterparts of random variables, probabilities, and averages measured 
in a laboratory. In linear quantum mechanics (QM) they are represented by eigenvalues and matrix elements of 
projectors or self-adjoint operators. In NLQM based on nonlinear Schrodinger equation none of those objects appears 
in a natural way. 



A. States 



We will discuss theories where states are represented in the same way as in ordinary quantum mechanics (the 
so-called normal states): by vectors or rays in a Hilbert space, or by density matrices. We should also keep in mind 
that possible exte nsions of QM may involve different manifolds of states such as general Kahler manifolds 



| [ll| , |l2 13 l4|l5| , M . A necessity of some extension of that kind is suggested, for example, by Penrose |17|. We will not 
include stochastic generahzations either, cf. 111]. 



B. Observables 

In Hamiltonian formulation of the Schrdinger equation (real and imaginary parts of) components of a state vector 
play a role of symplectic coordinates, and observables are bilinear functionals {tp\A\^), where A is a self-adjoint 
operator. 

The fact that quantum observables are so specifically chosen leads to a special, complex Hilbertian type of the 
quantum mechanical probablity calculus. It seems it was Mielnik who noticed [ p"9|j2C| , |2l] , p^ that since there exist many 
different and inequivalent probability models they may be related to some nonlinear version of quantum maechanics. 
Such a perspective seems quite natural and suggests that one should try to investigate theories where obsevables, like 
in classical mechanics, belong to a more general set of functionals. 

In the generalization proposed by Kibble PpX]] the set of observables consists of differentiable functions defined on 
a projective space. In the Weinberg formulation ||l| one assumes that the observables are defined on a Hilbert space 
but satisfy the (l,l)-homogeneity condition 

A{Xi;,i;)=A{i;,X^p)^XA{i;,i;). (1) 
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Averages are defined in |^ like in ordinary quantum mechanics. The (1,1) -homogeneity and the definition of averages 
introduces the projective structure into the set of states and, effectively, into the dynamics. This is the way the 
Weinberg approach can be regarded as a variant of the Kibble's one. (We shall see later that Weinberg's theory 
contains also some elements that are new with respect to the Kibble's proposal.) 

The homogeneity condition (|^) can be expressed by means of the Euler formula as follows 

, 9A J dA . 

'fn^^=VnTrr=A. (2) 

With the help of (||) we can deduce some general properties of the observables. Applying @) twice we find that 

OA - d^A - - » _ 

A = -^-—Ipn = V'm „ - „ , -0n = ''PrnAmn^n = {lp\A{->p , ■)p)\-)p) (3) 



and 

dA d^A 



V'„ = (i(V,^)|V'))™- (4) 



We can see that the homogeneity condition leads naturally to a nonlinear operator algebra. A matrix multiplication 
of such nonlinear operators can be easily expressed in terms of first partial derivatives of observables since 

dA QB - - 

_ = (^|i(V,V^)i3(^,^)|V)- (5) 

Olpra Olpm 

This ^-product conserves the (l,l)-homogeneity of observables because the nonlinear matrices defined in ^ are (0,0)- 
homogeneous. The (O,0)-homogeneity implies, on the other hand, that the matrices are, in certain sense, "almost 
constant". Indeed, 

/ ai , d / d^A \ ^ , d d^A 

by virtue of the 0- homogeneity Euler condition. Consider now a differentiable function t — )■ ?A„(i)- Eq. (^) and its 
complex conjugate imply that (V'l ^A('(/', '0)|V') — for any observable A. The algebra of observables equippedOB 
with the above *-product possesses a natural left and right unit element n{^, ip) = 

Despite some similarities to ordinary linear operator algebra our algebra of observables is associative if and only if 
the operators are linear. This leads to several differences with respect to ordinary quantum mechanics. For example, 
*-powers of A will not, in general, ^-commute with A, a fact that will influence the integrability of the theory. 

In linear QM the *-powers of an observable are related to higher moments of random variables measured in exper- 
iments, hence lead to the probabilistic interpretation of the theory. In nonlinear theory the product ^ leads to the 
first non-unique element of the formalism we shall meet later. 

For let us consider an observable A. We know that it can be written in a form of a matrix element where 
A is the nonlinear operator defined in (^) (for simplicity we shall, from now on, omit in our notation the dependence 
of A on The *-square of A can be defined as ^^^^ or If we want to define the third power we 

encounter several possibilities. The apparently most natural choice of {A *_A) * A oi A* [A* A) is not acceptable as 
they are in general complex and non equal even if A is real. In Subsec. II D| we shall see also that * does not lead 



to a consistent probability interpretation of A. The second definition can be chosen as {ip\AAA\il}) := A*A*A. We 
see that once we have defined A, which is unique, all its 5-powers are unique as well. Notice that since A* B ~ A*B 
both products lead to the same Poisson bracket (generate the same evolution of observables) . 

One of the most important questions of nonlinear quantum mechanics concerns a definition of observables that 
correspond to subsystems of a larger system. Let us begin the discussion with recalling the ways we do it in ordinary 
QM. 

By saying that H{'ipsuh,'4'suh) is a Hamilitonian function of a subsystem we mean that -ff (V'sub, "(Asub) describes a 
subsystem which is noninteracting and noncorrelated with the rest of the Universe (otherwise the subsystem is not 
described by a state vector). Quantum mechanically this means that the state of the Universe is \^p) — |tAsub) ^ IV'rost) 

(or p = |V^sub)(V'sub| ® Prest)- 

Subsystem observables in linear QM can be written as 

A{i:,i;) = (V-li® IrcstIV') = Trp^ubi. (7) 
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Let now {\r)} denote a basis in the Hilbert space of the "rest" and {|a)} be the one of the subsystem. A general form 
of the state of the whole system (here we assume for inessential simplicity that the whole system is in a pure state) is 



|V'>=^Va,.|a)|r)=5]|<i>W)|r). (8) 

a,r r 

An average of A in the state |^) can be expressed by means of the decomposition (^) as follows 

^(^'^) = Jm7\ = 2^- 



E 



(^1^) ^ (coM|$M) (V-IV) 



(9) 



The physical meaning of (^) is obvious: The whole ensemble of subsystems is decomposed into subensembles corre- 
sponding to different results of measurements of an observable whose spectral family is given by {|?')(?'|}, and the 
average of A in the subsystem is calculated for each subensemble separately. The form (H) is useful as an illustration of 
the "collapse" of a state vector phenomenon. Indeed, the average is calculated as though the ensemble of subsystems 
consisted of subensembles collapsed by an external observer and, in certain sense, the ensemble is treated as the one 
composed of individual objects each of them posessing some "property" measured (via correlations) by the external 
observer. 

The description in terms of the reduced density matrix psnh suggests a slightly different interpretation of the average. 
The form 

AiiP,^P)=TTpsu^A (10) 

means that we do not take into account possible decompositions of the ensemble, but treat it as a whole. The presence 
of the reduced density matrix in ( p^ ) does not mean, of course, that we cannot use the state vector's components — 
it simply restricts the form of observables to only some functions of 4'a.r- 

Quantum mechanics when viewed from the first perspective looks as a theory of individual systems but with some 
form of fundamental limitation of knowledge about states of such individuals. The second possibility turns QM into 
a kind of a "mean field theory" . Still, since both forms of description are physically indistinguishable such discussions 
whithin the linear framework seem purely academic. 

In nonlinear theories the situation is drastically changed. The nonlinearity is supposed to be fundamental, hence 
applying to each individual even if the individual is well isolated from other physical systems and no mean-field 
approximation is justified. 

The problem is practically the following. Assuming that we know a form of an observable describing a given 
subsystem itself (that is, if no interactions or correlations with the "rest" exist), what is the form of the observable if 
there do exist some interactions or correlations with the "rest" ? 

The first, explicitly non-mean-field-theoretic, guess for the subsystem's (additive) observable is suggested by the 
"collapse-like" form (^ of the average, i.e. 

A.ub(^,vi) = E^('^'^''''^^''') (11) 

r 

where A is a given function defined on a Hilbert space of the subsystem and the other definitions are like in (0). It 
is this definition that was chosen by Weinberg in his formulation of nonlinear QM. For non- bilinear A{¥'^\¥''>) the 
form and value of ( pl| ) depend on the choice of the basis {\r)}. 

Example 1 Consider the following non-bilinear observable 

where as is the Pauli matrix and \ip) belongs to a two-dimensional Hilbert space. Let the whole system be described 
by \ip) = Y.l=iY.ri'o.,r\a)i\r)2. Definition (|l]) yields 

A{Au^.,Au^) - )^ \^,^\2 + (13) 
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Let 



IV) =V'i.i|l)i 11)2+ ^2,2|2)i 12)2. (14) 

In this particular state 

A(V'sub,Visub) = IV-lll' + lfel' (15) 

so that each "collapsed" subensemble contributes to the observable separately. Let us choose now a different basis 
{|/)2} in the Hilbert space of the "rest", such that |1)2 = -^{\l')2 + |2')2), [2)2 = 75 (|1')2 - |2')2). Then 

= -L(^^^|i)^ +^22|2)i) |i'|2 + ^(Vii|l)i - fe|2)i) |2')2. (16) 
^ ' ■> ^ ' 

Substitution of |$(i'') and into gives 

IdV^llP- 1^-22 IT , IdVllp-lfen' 



A(V^sub,V.ub)-2 |^^^|2 + |^^^|2 '2 + 



iV'iiP + lfeP 



'22 I 

(17) 



which is also a sum of contributions from two subensembles, but now "collapsed to linear polarizations" . □ 
The above example reveals two properties of (|Tl|): 

1. This form is basis-dependedent so one should rather write Asuhiip, "0) = ^sub(V': "01 {k)}); 

2. If the "rest" is described by linear QM then no change of {jr)} can influece observables that correspond to "the 
rest" itself. In particular, if one considers a total energy of the whole system then a change of {\r)} will influence 
its value (we change the energy of the subsystem and maintain the energy of the "rest" ) . It follows that the 
form ( pi] ) can apply only to systems that, as a whole, are open (that is, there must exist also some another 
nonlinear system not contained in the "whole" one and compensating the changes of energy due to the changes 
of{|r)}). 

3. The "subsystem+compensating subsystem+rest" system must be described in a basis-independent way. 

The last remark indicates that in order to describe systems with "isolated" nonlinearities, that is with the "rest" 
linear, one has to describe subsystems in a way different from Weinberg's. The following modification was proposed 
by Polchinski ||. 

Consider a nonlinear observable A^ip^ip) corresponding to some isolated physical system, and assume that the 
observable can be expressed as a function of the density matrix p — \tp){il!\, i.e. A{4>,'ip) = A{p) (this is always true 
in linear QM). For example, the observable from the previous example could be written in either of the forms 



(Trpgg)^ 
Trp ' 

Tr (pcTspas) 
Tr p ' 

(Trpa3)^Tr(p^) 
(Trp)3 

or 

(Tr(pa3pa3))"+i 



Tr p(Tr pas 



|2n 



(18) 



(19) 



(20) 



(21) 
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and so on. For p = all of them, and all their convex combinations, reduce to (p^. The map p — > A{p) 

determines uniquely, but from the knowledge of A{'ip,il)) one cannot deduce the form of A{p) for p different 

from This trivial remark has important consequences. Returning to the "atom+field" example, we see that 

the knowledge of the nonlinear Schrodinger equation for the atom alone does not determine uniquely the form of the 
evolution equation of the "atom+field" composite system! The nonlinear equation describing the atom ij there are no 
correlations with the field is determined by the Hamiltonian function Hat{4'i but this function no longer describes 
the atom if some correlations with the field do appear. 

Therefore, it becomes clear that, in order to have a unique formalism, one should start from the very beginning with 
the description of observables in terms of density matrices. Such a reformulation of Weinbrg's theory was proposed 
by Polchinski and Jordan |2^,|2j] . In the second part of this paper I will generalize the density matrix formalism, so 
here we shall concentrate only on its interpretational problems. 

Example 2 Let jV') = ■(/'ii|l)i|l)2 + V'22|2)i|2)2. The reduced density matrix representing the subsystem is 

p=|^nni>i(l|i + |V'22n2)i(2|i. (22) 

The subsystem's observables are now 

(Trpa3)2 (|Viip-|^22p)' 



Trp |V'iiP + l^22P ' 

Tr (pcrapo-a) _ IV'iil'^ + IV'22 

Tr p h/)i 1 |2 -I- |?/)on 



(23) 



|^llP + |^22P' ^ ^ 



(Trpa3)^Tr(p^) ^ - |V.22p)^(|^ii|^ + ^^22^ 

(Trp)3 (|V'llP + IV'22P)3 

and 

(Tr(pa3pa3))"+i (IV'ii I' + 1^1')"+' 



Trp(Trpa3)2" (|7^nP + |V'22P)(|ViiP - \4'22Vf 



(25) 



(26) 



As we can see the observables are completely different. If \'ipii\ = \i^22\ (as in the spin-1/2 singlet state) the average 
of the first and third observables vanishes, is non-vanishing but finite in the second case, and infinite in the last one. 



Expression (|2^) seems, at first glance, the most natural one since can be rewritten as 



rest I v/ (27) 



Kibble in j|] did not explicitly define his way of describing subsystems, but the examples he discussed suggest this 
type of description. In the next section we will discuss problems with nonlocality of nonlinear QM and see that all 
the descriptions by means of reduced density matrices are more safe than the proposal of Weinberg. However, if 
some correlations between subsystems actually exist it seems we indeed can "collapse" a state of a remote individual. 
We may expect that an average value of this "remote" observable should be composed of the discussed averages of 
subensembles. The description by means of (|2^) does not seem to possess this property: It is the "average spin" that 
contributes to the observable (^3|). So we first calculate the "average" of (T3 and then square it. In the Weinberg 
description we first square the "spin" and then take the "average" . (The quotation marks are necessary because we 
have not defined yet what we mean by eigenvalues, or values of single measurements; there exists also another subtle 
point that has been ignored in the definition of (|l^) — in nonlinear QM averages of projectors, ^^I^^J^j^^ , may not 
have an interpretation in terms of probability (see the section on eigenvalues).) 

It is interesting that the "collapse" -like property of the average can be seen also in ( p^ where both values of "spin" 
are averaged separately. Finally, let us notice that the last observable ( [26| ) is unbounded, so that the values of single 
measurement cannot be identified with eigenvalues of a^.U 
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C. Dynamics 



Both Kibble and Weinberg assumed that the dynamics of pure states is given by Ha milton equations with Hamil- 
tonian functions belonging to the algebra of generalized observables discussed in Subsec. [IB . More generally, all one 
parameter flows of canonical transformations are assumed to be integral curves of the Hamilton equations of motion 
ix'jJ = dH. 

Since the evolution is, in general, nonlinear no dynamical separation of states and observables is possible (the 
ordinary Heisenberg and Schrodinger pictures can be formulated only if the dynamics is linear). Instead, the evolution 
of observables is governed by the Poisson bracket resulting from the Hamilton equations. 

The Hamilton equations in Kibble- Weinberg theory can be written in a Schrodinger-like form 

z^lV) =i?(V',V5)IV') (28) 

where H{ip, V') is Hermitian. An interesting class of nonlinear Schrodinger equations which are not equivalent to 
Weinberg's equations, even though the Hamiltonians are (O,0)-homogeneous, is given by equations of the form 

4w.(A„+«^B-<#Millrt:=H|«. ,29, 



In such a case all the nonlinear terms involving A and B cancel out in {ilj\H\ip) — {ip\Ho\ip), which, for this reason, 
cannot be regarded as a Hamiltonian function for (|29|). 

Example 3 Consider a two-level atom irradiated by an external classical light and assume that the Hamiltonian 
function of the atom takes the Weinberg form 

{^\Hl+HnlW (30) 

where the nonlinear 2x2 matrix is 

HNLmn = TTj = i^Q + ^A'L<?)mn- (31) 

d^rnOVn 

Let s{i^,tp) = (V'l^lV')- We find 









-A 








s = {s, H} = -ima, Hl + Hnl\ 

= 2{hL + hNL)^S = {ujL+UJNL)^S. (32) 

Defining 

cos ujt sin ut 
— sin Lut cos ujt 
1 

with uj being the frequency of light, we obtain after RWA the following nonlinear Bloch equations 

U — —Aw — LU^V + LO2W 

V = Au + flw + UJ3U — ujiw (33) 

W — — flv — Cl!2U + UJiV. 

Choosing (a)i, 0)2, (^s) = {—{A/2)v,{A/2)u,2ew) we get 

u = —Av — 2ewv + '^'^'^ 

i) = Am -1- f2w + 2ewu — -^''^w (34) 

r, ^2 ^2 

w = —ilv — —u — —V . 

2 2 
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Eqs. (]3J) are identical to Jaynes' neoclassical Bloch equations g] where e is the neoclassical Lamb shift and A is 
equal to Einstein's coefficient of spontaneous emission. So let us check what kind of the KW-Schrodinger equation 
can lead to the neoclassical description. 

Returning to the non-rotated reference frame we obtain 

{uJNLl,i^NL2,i^NL3) = (- (^/2)s2, (^/2)si , 2eS3) (35) 
and the nonlinear Schrodinger equation we are looking for is 

^H{i;,i,m. (36) 
If H is some (0,0) -homogeneous function then the Hamiltonian is also (O,0)-homogeneous. However 

(^i7?(v, m) = (^1 {m. m + ^^^^^^^3) 1^) (37) 

cannot play the role of the Hamiltonian function for ( |3^ ) with A ^ 0. For A = the relevant KW Hamiltonian 
function is 



and 



2 (V^IV') 

□ 



Hi^,^)^E-le^-^^. (39) 



D. Probabilities and Results of Single Measurements 

Let 7i be a finite dimensional Hilbert space and H a Hermitian operator acting in 7i. li H = {ilj\H\ip) is the 
associated observable, the values of single measurements of H can be defined in at least three equivalent ways. Since 
in nonlinear QM the three options will lead to different results we will use here different names for each of them, 
a) Eigenvalue E oi H = {iplHlip) is the number satisfying for some eigenstate the equation 

= AlV') (40) 

I— = AV^™. (41) 
01pm, 

det{H - Al) = (42) 

detf-i-^ - A<5,„„) = 0. (43) 



or 



b) Diagonal values are solutions of 



c) The third possibility comes from the fact that, since eigenstates form a complete orthogonal set of vectors in H, 
any solution of the Schrodinger equation = H\ip) can be expressed as 

/ Mt) \ ( V'i(0)e-'"i* 



(44) 



\i>N{t) I V^Jv(0)e-^""* 



and the frequencies ujn can be termed the eigenfrequencies . The three possibilities can be used for definitions of the 
results of single measurements also in nonlinear QM. 



Definitions (41) and ([43|) do not explicitly use the bilinearity of H, so can be naturally adapted also in the nonlinear 



framework. It is interesting that then they are not equivalent. 
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Example 4 The so-called "simplest nonlinearity" considered in experiments designed as tests of Weinberg's nonlinear 
QM corresponds to the following Hamiltonian function 



where Hq = . Denote '01 = \/37e*"i , ■02 = "v/^e"^ . The eigenvalue condition reads 



{ 



£^1+6 

E2 + e 



2 

-2 



A1-A2 /Al-A2^2^ 



' A2 = X\/A2 



:)]}■ 



A1 + A2 \Ai + A2^ 

For A2 = we find X+ = Ei + e, for Ai = = E2 + e. If Ai 7^ and A2 7^ we get, for e 0, 

Ai -A2 E2- El 



(45) 

(46) 
(47) 



A1+A2 



4e 



which implies {Ai and A2 are positive) 



and 



\E2-EA<A\e\ 



(48) 



Ao 



El 



-{El 



E2? 



with the eigenvector 



(1 



ic ) ^ 



(49) 



We find three eigenvalues although the dimension of the Hilbert space is two. The third eigenstate exists only for e 
satisfying the condition (^). 

Of course, all the eigenstates are stationary. The eigenstate corresponding to Aq is not orthogonal to the remaining 
two eigenstates, which excludes the ordinary probability interpretation in terms of projectors. 

Let us now calculate the diagonal values of this Hamiltonian function. The nonlinear Hamiltonian is 



R = 



El + e(8p3 _ 20p2 + 16p- 3) 8e|ViP|V'2pV'iV^2 



8611^1121^-21^^1^2 



E2 + ({-V + V + 1) 



(50) 



where the state is assumed normalized and p — IV'iP- The matrix is Hcrmitian and its eigenvalues (^diagonal values 
of H) are 



E1 + E2- 2e(8p2 - 8p + 1) 

(El - E2f ~ %t{(Ei - £2)(4p3 - 6p 

1/2 



4p - 1) 



+2e(20/ - 40p^ + 28/ - 8p + 1)} 



(51) 



The diagonal values, in opposition to eigenvalues, are in the nonlinear case functions and the number of them, again 
as opposed to eigenvalues, is always equal to the dimension of the suitable Hilbert space. 
The solution of the respective nonlinear Schrodinger equation is 



-0l(O)e"'(^i+2"<'^3>-e(CT3>^)t 
'02(O)e"*(-^2-2e(cr3)-e((T3)2)t 



(52) 



where the averages in the exponents are integrals of motion. It follows that the eigenfrequencies are also state 
dependent functions but differ from the diagonal values. 
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In linear QM mechanics observables correspond to averages hence, at experimental level, to random variables. With 
any random variable one can associate its higher moments. The higher moments, on the other hand, can be used to 
deduce a probability interpretation of the theory. Therefore, one of the essential points of any generalized algebra 
of observables is the question of the representability of higher moments corresponding to some given observable. In 
linear QM the problem is solved by the spectral theorem. 

The first interesting result concerning the nonlinear eigenvalues, noticed by Weinberg in [0, is the following 

Lemma 1 Let F and G be two (1,1) -homogeneous observables possessing a common eigenstate i.e. = fipu 
and C.C., ~- = gipk o.nd c.c., and let F ^ G = '§^^^- Then = fg'^Pk O'l^d c.c. 

The homogeneity implies that an average of F in an eigenstate is equal to the respective eigenvalue. For finite 
dimensional Hilbert spaces we know also that a number of eigenvalues is not smaller than the dimension of the space 
(a result from the Morse theory) and the eigenvalues are critical points of averages. For averages defined on the whole 
Hilbert/projective space their maxima and minima must be found at critical points (averages defined on a finite 
dimensional projective space are smooth functions defined on a compact set). These facts suggest that the eigenvalues 
are correct candidates for the results of single measurements of the generalized observables. However, the following 
examples show that the problem is not that simple. 



Example 5 The observable 



A - (53) 
(^ksl^) ^^^^ 



satisfies the Weinberg's homogeneity requirements. Its eigenvalues are ±1 hence do not bound the averages. It follows 
that such singular observables must be excluded if we want to have the probability interpretation in terms of the 
eigenvalues. The algebra of observables would have to be restricted to (l,l)-homogeneous smooth functions defined 
of the whole Hilbert/projective space. This requirement is not very restrictive. Notice that no difhculties will appear 
if we will apply to A the interpretation in terms of diagonal values or eigenfrequencies.D 

Example 6 In QM we often encounter observables whose eigenstates are degenerate. Consider now 

H^Eim+e^^^^- (54) 

Its eigenvalues are i? ± e and E. The number of them is hence greater than the dimension of the Hilbert space no 
matter how small e is. Again, no problems appear if we apply the diagonal values or eigenfrequencies interpretation. 
Had we substituted the first term with some (^^°), where Ei ^ E2, we would have obtained the third eigenvalue 
provided \Ei — £'2] < 6|e| so that the eigenvalue will not appear for sufficiently small e. This phenomenon seems to 
be related to the Kolmogorov-Arnold-Moser theorem where a dimension of an invariant torus may not change with 
nonlinear perturbation if a nonperturbed Hamiltonian system is nondegenerate and the perturbation is not too large 
PI. □ 



Example 7 Let 



mm 



where Hq — (^^°)- Two eigenvalues corresponding to the eigenstates |±) of (T3 are E+ — Ei + e and E- — E2 + e so 
are shifted by the same amount. The third eigenstate occurs if 

The third eigenvalue 

Eo ^1{E,+ E2) + eil - 2iV) (^1^) ^ (57) 

tends to Ei with N — > 00. This example is important. It shows that the ordinary projection postulate cannot be 
consistently applied, for calculation of probabilities coresponding to nonlinear eigenstates, in experiments whose goal 
is elimination of all nonlinear corrections to QM. □ 
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Example 8 Consider 



The eigenvalues are E and E + e. Probabilities calculated by means of 

(H) = {E + e)pE+e + EpE (59) 

and the normalization of probability are 

PE+e^{<y3f pE = l-{<y3f. (60) 

We know that H * H has eigenvalues {E + e)'^ and E'^, so we can calculate the probabilities by means of {H * H) — 
{E + tY'PE+c + E^pe and the normalization condition. Since 

{H*H)=E^ + (4e2 + 2Ee){cr3)^ - Se^ids)'^ (61) 

we obtain 

i4e' + 2Ee){a,)'-3e^{a,)^ 
^^+^ = 2E7T7^ (^2) 

which is, of course, inconsistent with the previous result. The troubles arise because of the nonassociativity of *. 
An analogous result was derived by Jordan p6| , who showed that propositions cannot be defined consistently within 
the *-algebraic approach to Weinberg's observables. We conclude that * cannot be applied for a definition of higher 
moments and the above lemma is useless from the viewpoint of the probability interpretation. □ 

Next candidate for a single result of a measurement of some H is the diagonal value. We have already seen that the 
diagonal values are state dependent functions. There is no general guarantee that a diagonal value of an integral of 
motion is itself an integral of motion. In fact, as the Weinberg equations are not necessarily integrable for dimTi > 2, 
whereas the number of the diagonal values is equal to the dimension of TC, the diagonal values, for N > 2, can be 
time independent only for integrable systems. In addition, as diagonal values are roots of algebraic equations, they 
even do not have to be differentiable functions of states. Therefore, we cannot find for them any general equation of 
motion. 

In spite of these disadvantages the diagonal values are in some respects superior to eigenvalues. If for two observables 
A and B the commutator AB — BA = 0, then A and B can be diagonalized simultaneously and the product AB has 
eigenvalues being products of the eigenvalues of A and B. This leads to the following unique probability interpretation. 
Let U {tp, ip) be the unitary transformation diagonalizing H. Then 

i7-^i?„(V',V;)|(C/^)„p. (63) 

n 

The functions p„ = | ([/V')nP/(V-'l'0) Biie probabilities resulting from the higher moments procedure. 

Let us recall that for two observables F * G — F*G. The nonassociativity of * is reflected in the *-algebra of 
observables in the following non- uniqueness of *. Although always 



but for non-bilinear observables 



{tlj\{m...*H)\ijj) = {ijj\H . . . H\'4>) (64) 



[H* ...*H) ^ H ■ H. (65) 



The eigenfrequencies approach can be applied only to systems that are close to integrable [ p5[ | . For finite dimensional 
Hamiltonian systems some general KAM theorems are known. It is known that components of state vectors exhibit 
in such a case, unless some resonant initial conditions occur, a quasi periodicity described by 

^fe(^) = X! Cfc(ni . . .7iAr)e~*S-"'''^"* (66) 

n 1 . . . n iv 

where both the amplitudes and the frequencies are dependent on initial conditions, as shown in our first example in 
this subsection, and the sums run over all positive and negative integers. The homogeneity condition implies that 
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(V'iv^) = EEi^'=("i---)i' (67) 

k rii... 

and 

fe m... u 

These formulas lead to the natural interpretation of the eigenfrequencies n^uoi, as the results of single measurements 
and the respective normalized coefficients as probabilities. The interpretation can be naturally extended to other 
observables by assuming that all observables generate one parameter groups of canonical transformations via the 
Hamilton equations of motion. The same result would be also obtained by means of the "standard" theory of 
measurement where we assume that the so-called pre-measurement is described by an interaction Hamiltonian function 
which is proportional to the observable measured, and that all the measuring procedures are based on measurements 
of observables that are bilinear like in linear QM. To be precise, we must remark that this procedure (chosen finally 
by Weinberg) is not unique either because there is no unique description of the composite "object -|-observer" system 
even if the observer is assumed to be linear (see remarks in the section devoted to observables). Another difficulty 
is that the assumption about measurements based only on bilinear observables cannot be formulated in a relativistic 
way. In standard measurements one measures positions or momenta. A theory where momenta are always linear 
observables whereas the Hamiltonian function is a nonlinear observable cannot be relativistically covariant. Perhaps 
it was this point that made Weinberg admit that he "could not find any way to extend the nonlinear version of 
quantum mechanics to theories based on Einstein's special theory of relativity" [^. My own proposal presented in 
part H avoids this problem, because I introduce the nonlinearity through entropy, which is not an observable, hence 
without any modification of the algebra of observables. To better understand mutual relations between eigenvalues 
and eigenfrequencies it is interesting to compare the two notions in situations where a number of eigenvalues is different 
from this of eigenfrequencies. 



Example 9 We know that 



H = i?(V.|V^)+e*#!. (69) 



has three eigenvalues E ± e and E. The eigenfrequencies are 

Ei{'iP,^) = E + e{3{c73f-2{a3f) (70) 
E2{^P,^) = E + ei-S{a3f -2{a3f) (71) 

The three eigenvalues correspond to eigenvectors satisfying 

( for ?/'° 

(ffs) = < 1 for V'' 

[ -1 for V^-i 

In this notation 





E 


with probability pi^ip'^ ,tp^) = 


1/2 


£;2«,^°) - 

Eii^-\i;-^) = 


E 


with probability p2{ip^, ij^'^) = 
with probability piiip^, = 
with probability p2{ip^, v) = 


1/2 


E + e 


1 


E-5e 





E + be 


with probability pi{ip~^, 4'^'^] 


= 


E^{4,-\i>-^) = 


E-e 


with probability P2{4^^, 4'^'^] 


= 1 



As expected even in an eigenstate there are two eigenfrequencies. The ones which are not equal to the eigenvalues 
occur with probabilities O.D 

A reader of the main Weinberg's paper may be a little bit confused with what is finally understood as a result of 
a single measurement. A half of the paper suggests that this role will be played by eigenvalues, then probabilities 
are defined in terms of eigenfrequencies and, finally, in the analysis of a two-level atom a difference of eigenvalues is 
treated as the energy difference while the difference of the eigenfrequencies is treated as the frequency of the emitted 
photon. It seems that from the viewpoint of the analysis above the eigenfrequency difference should be treated as the 
energy difference of atomic levels. This is one of the subtleties that have been omitted in the analysis of experiments. 

I think that each of the three possibilities lacks elegance and generality, and this is one of the reasons one should 
look for a possibility of introducing nonlinearities without altering observables. 
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III. COMPOSITE SYSTEMS AND NONLOCALITY 



One of the most often quoted "hiipossibihty theorems" about nonhncar QM is that any such theory must imply 
faster than light communications. This statement is evidently too strong. In fact, we will show in part II that there 
exists a rich class of theories where the mentioned phenomenon does not occur. As we shall see the theories must 
satisfy two conditions. 

1. Their interpretation must not be based on the "collapse of a state vector" postulate. It means that any 
reasoning based on this postulate has to be regarded as unphysical. (In linear QM there exist such limitations. 
For example, all "counterfactual" problems like EPR paradox or Bell theorem can be eliminated trivially by 
rejecting reasonings involving alternative measurements.) Interpretations of QM that do not introduce the 
collapse (projection) postulate exist, to mention the many worlds one ||2^]. 

2. Observables correspondin g to subsystems must be functionals depending on density matrices of those subsystems 
(the Polchinski postulate [p3| ). 

In the second part of the paper I shall prove some general, quite strong theorems related to the latter condition. 
Here, we shall limit ourselves to several simple examples showing different ways of making quantum mechanical 
nonlocality "malignant" , to use the marvelous phrase of Bogdan Mielnik. 

It seems it was Nicolas Gisin who was the first to observe that a nonlinear evolution can lead to a faster-than-light 
communication between two separated systems. His argument was the following p9[ |. Let 7i be a finite dimensional 
Hilbert space, {ipi), (z Ti, i = 1, . . . ,n, j — 1, . . . ,m, {^ilipj) — — Sij. Then the following lemma holds. 

Lemma 2 If for souic noTivanishing probabilities x^, yj 

^x,|7^.)(V;.| = 5]y,#,>(0,| (72) 

i j 

then there exist two orthonormal bases {\ai)}, {|/3j)} in some Hilbert space Ti! and the state 

i 3 

□ 



The proof can be found in |29|. The meaning of the lemma is that the two decompositions of the density matrix can 
be obtained by means of the EPR correlations. Indeed, we can take the density matrix |x)(xl s-iid trace out Ti' . The 
resulting density matrices are these appearing in the lemma. Assume now that we have a nonlinear evolution of pure 
states 

mi-^^^gtmm). (74) 

Then, in general, 

Y^.gtm{i^.\) ^Y.yM\<t^3){'i^3\) (75) 

i 3 

even if initially the two decompositions were equal. The important assumption leading to the faster-than-light 



communication is that each of the pure state sub-ensembles evolves according to (74), even if the whole state is 
given by their convex combination. This seems reasonable if we assume that the ensemble consists of the collapsed 
sub-ensembles. If we apply some "no-collapse" interpretation then the argument cannot be consistently formulated. 
However, it has to be stressed that the form of the evolution appearing in ( [ts] ) can be derived from quite general 
assumptions. It was shown in a rigorous way in the language of theory of cathegories by Posiewnik Jsot that this form 
is implied by Mielnik's definition of mixed states as probability measures on the set of pure states. This remark has 
quite nontrivial consequences: The description of mixed states in terms of probability measures (Mielnik's "convex 
approach" ) combined with putative nonlinearity of the Schrodinger equation leads to faster that light communication. 
If one wants to get rid of such difhculties one cannot keep the figure of states convex. 

Gisin, in his second paper [3l| ] considered an example of a nonlinear evolution taken from the Weinberg paper 
10. Consider an ensemble of pairs of spin-1/2 particles in the singlet state and assume that in one arm of the 
experiment the evolution is given by the Hamiltonian function (■0|tT3|'!/')^/(V'|'/')- An experimenter in the other arm 
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chooses between two settings of his Stern-Gerlach device and decomposes, by means of the EPR correlations, the 
ensemble in the other arm into subensembles corresponding to, say, spins up or down in the z direction, or spins up or 
down in some u direction tilted at 45° with respect to the z-axis. The evolution equation, up to an overall phase, is 
given by i^lV') = 2(cr3)cr3|?/')- If the initial state is either up or down in the z direction the evolution is stationary and 
the average of is always zero. If the state is either up or down in the u direction the spin is precessing around the 
z axis but the sense of the rotation is opposite for the up and down states. After a time of a quarter of the "Larmor 
period" the spin will have the same positive value of ((T2). And this result can be detected by an observer in the arm 
with the nonlinearity. 

As we can see Gisin in both of his proofs ignored details of the description and the evolution of the "large" system. 
To be precise he should rather define a Hamiltonian function of the two arms, then solve the nonlinear Schrodinger 
equation for the whole system or calculate the evolution of the average of 02 in one arm. This is the crucial point. 
We know already that the description of composite systems is not unique as long as we know only the evolution of 
pure states of subsystems. It will be shown below that the Gisin's telegraph will never work if we describe the whole 
system in the way proposed by Pochinski. It will be shown also that the proof is valid for the specific choice of the 
description proposed by Weinberg. 

But before we shall pass on to the details of the calculations let us consider another "general proof which was 
formulated by myself independently and simultaneously with this of Gisin ||3^ . Consider a polarizing Mach-Zehnder 
whose first beam splitter is an analyzer of circular polarizations (for a detailed description of the polarizing Mach- 
Zehnder interferometer see [^). A reflected beam is righthanded and the transmitted one — lefthanded. The 
lefthanded beam passes through a half-wave plate and then the two beams are phase shifted by a with respect to 
each other and recombined. 

If a source produces a linear polarization state, say 

lV')-^(l+> + |-». (76) 

the interferometer transforms the whole state into 

l'/'') = ^(^(e'" + l)|+) + (l-e-)|+)). (77) 

and we observe the interference between the two outgoing channels. 
If, however, the source produces a singlet state fsl] 



the whole state transforms as follows 



->|-», (78) 



|->l+> + ^ ^ 

and there cannot appear the interference in the interferometer (subsystem I): The linear dependence of the states 
which interfered in the linear polarization case is destroyed by orthogonality of their singlet state partners. The lack 
of the interference in I is therefore a nonlocal phenomenon. 

We know that nonlinear evolutions in a Hilbert space do not conserve scalar products (the "mobility phenomenon" 
p2[) and states that are initially orthogonal may loose their orthogonality during the course of the evolution. It is 
clear that if we will violate in some system II separated from I the orthogonality of |-t-) and |— ) because of some 
nonlinearity then the photons in I will start to interfere and the interference will be the stronger the more violation of 
the orthogonality has been obtained. Putting it differently, tracing over a space where the evolution does not conserve 
scalar products may result in some "remains" of the traced out system in the reduced density matrix describing the 
remote, separated system. 

This argument looks general but a careful reader has probably noticed that one additional assumption has been 
smuggled here: It is implicitly assumed that a nonlinear evolution in H can be extended to Ti®Ti' in such a way that 
a solution describing the composite system is of the form 

10) ® 1+) + W) ® h) (80) 

where and \(t)') are solutions of the subsystem's nonlinear evolution equation that are "in mobility", that is whose 
scalar product is not conserved. This assumption is a strong limitation. In fact, it never occurs if separated systems 
are described in a Polchinski's way. However, the proposal of Weinberg does allow for such pathological solutions. 
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In what follows we shall formulate the arguments of Gisin and myself in a precise way within the framework of 
Weinberg's approach. 

Let us assume that we have two separated systems, I and II, described by Hamiltonian functions 

H,{^,^)^E,{^\^), H2{x,x) = E2{x\x)+e^^^^^. (81) 
According to Weinberg, the Hamiltonian function of the whole I+II system is given by 

^^(V',V^) -5I^i(^W'<^w) + II^2(X(fe),X(fe)), (82) 
/ fc 

where ipk{i) = Xi(k) = ^ki- 

A general solution of the nonlinear Schrodinger equation corresponding to (182) takes the required form 



l^> = 16) ® \4>) + 16) ® (83) 

where = (q),|'^2) = (q)iQ^ = exp(— ii?it) and |0) = (pilV') ~ (^/) are some solutions of the single particle 
nonlinear Schrodinger equation. 
The Hamiltonian functions 

Hi{il),il))=^Hi{Lp(i),Cp(i)) and H2{^,4') = Y^H2{x(k),X(k)) (84) 

I k 

commute because Hi{ip,ip) — Ei{ip,^p). Bearing in mind that there is no interaction part in H, we conclude that 
there is no flow of energy between I and II. 

Let us now calculate the reduced density matrix of the linear subsystem I. With the notation of (^3|), we find 
Pi = 5(I6)(6I + + (V'|0)I6)(6I + (0IV')I6)(6I)- The off-diagonal elements vanish in the linear theory (e = 0) 

if initially the states are orthogonal, and we get a "fully mixed" state. For e 7^ these coherences oscillate with the 
mobility frequency. There therefore exist observables whose average values oscillate in this way. For example, the 
components of spin satisfy (tri) = ((T3) = 0, but ((72) is proportional to sin(4e((T3)i) hence depends on the parameter 
characterizing the nonlinearity of the other, separated and nonintercating system. 

Let us complete the analysis of this part with two comments. 

1. The telegraph is based on the fact that an average of an observable in the linear system I depends on parameters 
of the Hamiltonian function of the nonlinear system II. Recalling the form of the Poisson bracket equation for 
observables related to I = {Ai,Hi + H2} we can see that such a dependence is possible if and only if 
{Ai,H2} 0. It is an easy exercise to check that this condition is met indeed in the case of A = (V'|o'2|^) 
and H2 from (js^). It follows that the necessary an sufficient condition for elimination of telegraphs based on 
the mobility phenomenon is that any observables corresponding to separated systems must be in involution with 
respect to the Poisson bracket generating the evolution of observables. The observation that such observables 
are not necessarily in involution in the Weinberg approach is due to J. Polchinski psf . 

2. If one of the systems is linear, then the telegraph based on the mobility phenomenon can be used for sending 
information only from the nonlinear system to the linear one. In Gisin's telegraph one utilizes the remote 
preparation of mixtures entering the nonlinear system. Accordingly, this kind of telegraph works in the opposite 
direction hence cannot be equivalent to this based on the mobility phenomenon. 

Let us describe now the Gisin's telegraph more precisely | ]35| . Consider again the same Hamiltonian functions of 
the subsystems and the Weinberg's description of the whole one (Eqs. ( ^l| ) and (|8^)). 

The sender is free to choose a basis in his Hilbert space by a rotation of his Stern-Gerlach device. Let the unitary 
matrix with unit determinant 



a (3 
-P a 



(85) 



describe the freedom in the choice of bases in the linear subsystem and the relation between the components of the 
whole state in the chosen basis and in the spin up-down one is given by 



■^21 V'22 / V -/3 a y I -0-+ 



(86) 



15 



The state is initially the singlet, which means that 



^ii(O) Vi2(0) a P\{0 l\^J_{-f3 a 

V'2i(o) ^22(0) V2 V-/^ "/ V-1 0/ V2V-"-/3 



(87) 



Finally, the solution for the whole system is 



where X — |/3p — jap. The reduced density matrix of the nonlinear system II reads 

= ^1 + Re(a/3) sin(4e(|a|2 - |/3n<)CT2 + Im(a^) sin(4e(|ap - \/3\^)t)ai. (89) 

The average of (T2 in the nonlinear system is 

(era) = 2Re{aP) sin(4e(|ap - \P\'^)t) (90) 

hence depends on the choice of basis made in the linear one. 

Notice that since the Hamiltonian function of the linear subsystem is proportional to (V'lV') it is in involution with 
any observable. In particular {(0-2), -ffi} — which means that the commutability of observables corresponding to 
separated systems is not a sufficient condition for the nonexistence of faster-than- light telegraphs. The dependence 
of ((72) on a and (3 follows from the dependence of H2 on these parameters. We have remarked already in the section 
dealing with observables that the Weinberg's choice is basis dependent: By a change of basis in I we can change a 
value of energy in II. 

It becomes clear now under what conditions this kind of pathology can be eliminated. Consider a density matrix 
p describing a "large" system. A change of basis in a subsystem I is represented by the unitary transformation 

Ui <»liipU]^^ <»lii. (91) 

Any function of p that can be written in a form of a series 

k 

transforms in the same way. It follows that all expressions like 

TT{f{p)li®Aii) (93) 

are {//-independent. Also all functionals depending on reduced density matrices of subsystems are independent of 
changes of bases outside of those subsystems. In the second part of the paper I shall prove a general theorem stating 
that if two observables depend on reduced density matrices of different subsystems then they are in involution with 
respect to a large class of (Poisson or generalized Nambu) brackets. In such a case both kinds of telegraphs will be 
eliminated. 

The last point that has to be explained is the question of uniqueness of the dynamics of subsystems. We have 
shown already that a knowledge of observables on pure states of subsystems does not determine their form if the 
subsystems are correlated with something else. Here we will show that observables that differ only on mixed states 
will, in general, generate different evolutions of subsystems. In the "atom+field" case the evolution of the atomic 
inversion will depend on our (arbitrary) choice of the description. 

Consider our "canonical" example (81) but rewritten in a form including density matrices. For the sake of clarity 

let us also use the more general matrix e = ( ^ ^ ] istead of sfeaz- 

We have now an infinite number of possibilities. The simplest nontrivial ones are 

ffi(pi) = £;iTrpi, i/2(p2) - i^2Trp2 + ^^^^^ (94) 

lrp2 



and 



ffi(pi)=£;iTrpi, F2(p2) = i?2Trp2 + ^^^^^. (95) 

Trp2 K^^PiY 
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Let pki = Ylm'^Prnki'ml denote the components of the reduced density matrix p2. The Hamiltonian function of the 
whole system is 

H{p) = Hiipi) + H2{p2). (96) 
The two forms of H2 lead, respectively, to the following two evolution equations 

— Pfcz = -2z^j^Pfci(efe - ez) (97) 

and 

d ^TTp2i Tr(pi) 

—Pki = -2^7- — r^pki[ek - ei) (98) 

dt Trp2 (Tr/92) 

where the expressions involving traces are integrals of motion. The equations are different. 

There is only one way out of the above dilemma: The description must from the outset be given in terms of density 
matrices. 



IV. A TWO-LEVEL ATOM IN NONLINEAR QM 

The aim of the analysis below is to clarify some elementary features of the nonlinear formalOAism involved in 

calculations of optical phcncomcna in two- level atoms. The contents of this section should not be understood as a 
complete, unique solution or systematization of all the questions encountered. Still, I hope that I have managed to 
point out some elements essential for correct computations in practical, experimental situations. 

In linear QM a two- level system is mathematically equivalent to a spin- 1/2 nonrelativistic particle. All the examples 
discussed in this work were based on a two-dimensional Hilbert space and the reader may have a feeling that they 
could be applied equally well to both spin-1/2 particles and two-level atoms. This kind of conviction has been shared 
by all the authors dealing with th(X)r(rtical and experimental aspects of nonlinear QM. The main result of this section, 
as we shall see later, is that, paraphrasing G. Orwell's words, in nonlinear QM all two-level systems are two-level but 
some of them are more two-level than others. 

We shall assume that the putative nonlinearity is of purely atomic origin. This means that we shall consider the 
atom as a nonlinear subsystem of the larger "atom+field" system where both the field and interaction Hamiltonian 
functions are linear in density matrix. (The papers discussing the problem can be divided into two groups: Either 
the authors do not care about the description of the "atom-|-field" composite system or treat it in the way proposed 
by Weinberg. We know that none of them can be correct unless the atom and the field are in a product state, which 
is typical for semiclassical treatments. It follows that no really quantum description of the problem has been given 
as yet.) We shall assume also the dipole and rotating wave approximations. To clarify the role of the latter we shall 
briefly treat both Am = and Am = ±1 cases. 

We begin with the form of the atomic Hamiltonian function. The simplest one (at lest from the point of view of 
simplicity of calculations) is 

H^^[p]=T,pHL+^^^^ (99) 
ir p 

where Hl is the linear Hamiltonian of the atom and e is an operator commuting with ■ Assuming that we consider 
the atom in a pure state p= we find that a general solution of the resulting nonlinear Schrodinger equation is 

(in this section we shall use the ordinary units with h ^ 1) 



Mt) = MO) exp[-^{Ek + 2{e)ek - {ef)ty (100) 



The averages of e are integrals of motion and, of course, depend on all nonvanishing components of \ip). This is 
an important point. In the analysis of a coupling between the atom and an external electromagnetic field we meet 
two difficulties. First of all we have to decide which states will be involved in the absorbtion-emission process. In 
linear QM the situation is simple: We take two stationary states of the noninteracting atom. In nonlinear QM the 
atomic nonlinearity may lead to stationary states that are not orthogonal between one another. Atomic creation and 
annihilation operators corresponding to such levels cannot satisfy ordinary anticommutation relations. 
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To avoid such comphcations the interaction term we shall choose will be defined in the ordinary way, that is, in 
terms of creation and annihilation operators corresponding to the levels of the linear Hamiltonian. 

Second, we cannot a priori restrict the atomic Hilbert space to two dimensions because a k-th eigenfrequency 
depends on amplitudes of all other components of lip) and the evolution cannot be naturally "cut into iV-dimensional 
pieces" . Of course, we cannot also assume that only these are nonvanishing which we are interested in (a probability 
that in a given case we will find just those only nonvanishing ones is 0). Therefore, to make the analysis perfectly 
consistent we should give up the two level approximation in the interaction term. We shall not, however, consider 
such complications although this approximation will further restrict the physical validity of the calculations presented 
below. 

Let bk, fcj. be the fc-th level atomic annihilation and creation operators satisfying the fermionic algebra [bk,bj]^ = Ski 
and a, be the annihilation and creation operators of a monochromatic photon field whose frequency is uj. The choice 
of the creation-annihilation operators language leads naturally to the following Hamiltonian function of the whole 
"atom+field" composite system 

HA+Fiip,i^) = {ip\ ^ h.ujkh\bk + fu^a^a + ^{b^bia - a^b\b2)\^) 



k 



i^iEk^kbik 



k\ 



The state in the Fock basis is \ip) = X^fen '4'kn\k)\n) . The nonlinear term is therefore equivalent to 



(101) 



(102) 



which seems natural but, as we have seen before, is only one out of the whole variet y o f inequivalent possibilities. 
Such a description is consistent with the definition of the atomic Hamiltonian function ( |99| ) (in this sense it is unique) 
and free from any "malignant" nonlocalities. On the other hand it is in a mean-field style; we have remarked in the 
section on observables that 

T^'P^^^P^T^ (103) 

Tl PAT 

would look more "fundamental" . Anyway, although the conclusions drawn on a basis of such an analysis are limited 



in their generality, some choice has to be made and calculations with (103) would be more complicated. 
The Hamiltonian resulting from (|101|) is 



Ha+f = TT,uJkb\bk + huja)a + —^{b^bia — a'b\b2) 



and the nonlinear Schrodinger equation is 

2 

--?i(JV^V2,n-l 



(104) 



+ -hq\/n + 1-01, „+i 



'Elm IV'Jmp V Elm IV'/ml 

forfc>2. (105) 



18 



Writing ■0fen = ^kn exp(— iafc„/?i) we find that for fc > 2 Akn = const for all n. Since also \\ip\\ is time independent 
(hereafter we put \\tp\\ = 1) it follows that for > 2 the exponents depend on time also via (V'|ei2|V') = + 
e2|V'2nP) whose explicit form has to be determined. Decomposing 

W^iM = Im^i + e2Kb% + blb^M + i(^|(ei - e2){b% - b%)\^) 

= : (ei + e2){moW + (^2 - ei)(V|i?3|V'), (106) 

where the first expression is an integral of motion, we see that the problem reduces to calculating {^plR^lip) which is 
one half the atomic inversion. Denoting e = Ip, the total Hamiltonian can be decomposed now into two parts 

Hi = h{uj2 - uji)R3 + 2(62 - ei){ip\e\ij)R3 + fiua^ a 

+ ^ib\hia-a^b%l 

H2 = n{uji + uj2)Ro + ^"^kblbk + 2(?M£|V')((ei + e2)i?o 

fc>2 

-Y,heublbk)-{m^?- (107) 

fc>2 



Operators, -R3, Ri = 5(^2^1 + b\b2), R2 = §(^^2^1 + ^1^2), a and commute with H2 so that the evolution of the 
atomic operators Rj is generated by the following nonlinear generalization of the Jaynes-Cummings Hamiltonian 

Hi ^ hujoRa + 2eo{ij\e\Tp)R3 + huoa^a + ^-^{R+a - a)R^), (108) 

where loq = L02 — toi and eo = £2 — ci. To explicitly distinguish between initial conditions and dynamical objects we 
shall decompose {^\e\ip) as follows 

(V'|e|^)= J2 ^kHkn\^ + iei+e2){MRo\^)+eo{mM -^A + eoimM (109) 

k>2,n 

where A is a constant depending on initial conditions. Denoting further hojQ + 2eoA — hujQ, 2t^ — fie we finally obtain 

Hi = huj'f^R^ + ne{ijj\R3\ijj)R3 + hujo^ a + ^^{R+a - a)R^). (110) 

It is clear now what is the actual meaning of the two-level approximation in nonlinear QM. The evolution of the 
atomic operators Rj is generated by a two-dimensional Hamiltonian in analogy to the linear case, but the parameters 
of Hi depend on components of the wave function corresponding to the levels being outside of the two-dimensional 
Hilbert space. On the other hand, the phases of the remaining components depend on the average of the atomic 
inversion of the two levels. 

(It should be stressed that one can consider a quantum system whose Hamiltonian function is a sum of a linear 
term and some nonlinearity which involves only spinor components of the wave function. Formally, such a case would 
be equivalent to a composite system whose constituents are a scalar particle that evolves according to the laws of the 
linear QM and some nonlinear two-level system noninteracting with the particle. We know that various solutions of 
the nonlinear Schrodinger equation, including the product one, will exist and no components other than the spinor 
ones will be involved in the nonlinear part of the evolution. Systems with nonlinearities of this kind would be "truely 
two-level" as opposed to the systems which are two-level in the sense specified above.) 

Our next task is to find and solve an equation for the atomic inversion. The Poisson bracket evolution equation 
reads 

inj^{^R3\^) = mR^.HiM = ^{^R+a + a^R^^) (111) 
so is just like in linear QM [^. Following the notation of ||3^ the second derivative is found equal 

^(V'|i?3|V') = V(V'|i?3(iV + i)|V') 

+ ^(A' + e{ij\R3\^)){^\R+a - R^\^) (112) 
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where N = + a^a and A' = uj'q — oj. Define 



B= hcUk\An\^ +HuJi+UJ2){i^\RoW- (113) 



fe>2,n 



('0|7V|^/'), hke in the hnear case, is constant. In order to get rid of the average in the last row of (112) we rewrite the 
whole Hamiltonian function as follows 

Ha+f = hujA+F = A^+B + huj{^P\N\i^) 

+?iA'(V'|i?3|V') + eo(^l^3|^)' + '-^{m+a - a^R-\i^). (114) 
Denoting w — 2{-ip\R3\ip), uia — A'^ /fi, = B/h, llirwa — ^a+f — we finally get 

■ii) = 2A'{ujRWA -uj{N) - loa) 

+ {e{ujRWA - uj{N) - uja) -A')w~ q^{^\R:i{N + 

--eAV--u;3. (115) 
4 8 ^ ' 

We have met here the characteristic inconvenience of the Poisson bracket formalism of nonlinear QM: The nonexistence 
of the Heisenberg picture. In the linear case we can solve the Jaynes-Cummings problem completely, independently 
of any particular initial conditions for states. Here the term {tl)\Rz[N + involves correlations between the atom 

and the field and I have not managed to express it solely in terms of constants and w unless the state is an eigenstate 
of TV, or a semiclassical decorrelation is assumed. So let initially the state of the system be a common eigenstate of 
i?3 and a with respective eigenvalues n' and n. The atomic inversion satisfies then the general elliptic equation [3^ ] 

w = 2A'(A'n' + \e) + fe(AV + \()- A' -^{N + \)]w - 7eAV - . (116) 
8v 8 22/4 8 



Although we could try to find a general expression for the atomic inversion following from (116) it seems more 
instructive to make here some simplifying assumptions. First of all we can take the "two-level initial conditions" , that 
is, assume that the initial state of the system is such that the only nonvanishing components of the wave function are 
these with k ^ 1,2. Then hA' = hA + ~ where A — ojQ — Lu. (Let me remark here that in most of the papers 
dealing with two-level systems (cf. [|[Q]) their authors assumed that for the "simplest" nonlinearities €2 = which 
seemed to suggest that the nonlinearity must shift the resonant frequency. As we can see, the more symmetric "cts" 
choice does not change the detuning.) Further, choosing the "detuning" A' = and denoting e^/8 — 2<;^ we get 

w = {2(.^ -n^)w -2<;^w^. (117) 

This equation can be solved immediately. For example, with the initial condition w(fi) — ~1 we find 

( -~cn{nt,<;/n) for f7 ><r 
w{t) = I -sech(ni) for Q ^ <; (118) 
[ -dn{<;t,n/<;) for n< <; 

The result is analogous to this of Wodkiewicz and Scully ||] who chose the Bloch equations approach. 

It is an appropriate point for a brief comparison of our results with those of Weinberg who chose his own, basis de- 
pendent description of the "atom-|-field" system. Consider the same form of the atomic nonlinearity. The Hamiltonian 
fimction of the composite system in the Fock basis (this basis was chosen by Weinberg) is 

Ha+f{'4),tP) = {i>\ Y^^kb\bk + fu^a^a+ ^(4^ia - a}b\b2)\ii^) 

(119) 



E 



where P„ project on n-photon states. Assume now that initially the state of the whole system has only one nonvan- 
ishing component V'li (one photon and the atom in the ground state). It follows that only ■01 1 and ^"20 will appear 
in the nonlinear Schrodinger equation. Our "nonmalignant" choice leads to the Schrodinger equation of the form 
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1 



-n#n, (120) 



while the Weinberg's one leads to 



2 /^J,/„J2,2 



-^?i#20 



1 



(121) 



Simplifying the fractions we obtain 



ihipii = {hull + - ^?i#20 

ihip20 = [hL02 + el)ip2a + -jj^qi'ii (122) 

which are linear and the only modification with respect to ordinary QM is that the energy levels of the atom are 
the nonlinear eigenvalues corresponding to the atom in the absence of radiation. In addition, the "(J3" nonlinearity 
satisfies €\ = e\ so that for this choice of the "simplest" nonlinearity neither the energy difference nor the shape of the 
inversion's oscillation would be affected, whereas we know already that the "correct" Polchinski's description leads to 
elliptic oscillations even for w(0) = — 1 and N + \ = 1. 

This result explains the difference between the calculations of Weinberg and those of Wodkiewicz and Scully. Both 
of them are based, more or less implicitly, on different assumptions about the description of composite systems. 
Nevertheless, it is clear that had we chosen some other "correct" form of the total Hamiltonian function, we would 
have obtained some other solution for the atomic inversion. 

In the calculations in this section we have not needed any assumption about the "smallness" of the nonlinearity. 
Moreover, as we have shown before, it is not evident what should be actually meant by a small nonlinearity. This lack 
of uniqueness is related to the fact that there exist singular nonlinearities that are negligible in the lack of correlations, 
but can become dominant if the nonlinear system in question correlates with something else. Anyway, even ignoring 
these subtle points it seems reasonable to expect that a physical nonlinearity, if any, should be in ordinary situations 
small in some sense. Therefore, it becomes interesting to understand in what respect the solutions we have found 
depend on approximations. In particular the role of the rotating wave approximation should be clarified. 

The easiest way of doing that is to consider transitions with the selection rules Am = ±1 involving circularly 
polarized light. It can be shown easily that the only difference with respect to the Am = transitions discussed above 
is the necessity of substituting qq for in the equations for w, so that no qualitative change in the time dependence 
of w will appear. 



V. WIGNER'S FRIEND REVISITED 



One of the essential elements of the argumentation against the description of composite systems proposed by Wein- 
berg was its dependence on a particular basis in a Hilbert space. In ordinary quantum situations such a dependence 
is difficult to accept. However, the situation changes if we enter the domain of the measurement theory. Notice, that 
if the role of the composite system is played by an "object" and an "observer", and the nonlinear evolution occurs 
in the state space of the observer, then there exists a priviliged basis in this space, the pointer basis [ psf , and this 
is the basis that has to be chosen for calculation of averages of observed quantities, unless one uses the many- worlds 
interpretation, which does not select any basis. 
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The idea that the domain of (coscious) observations is a natural arena for a nonhnear evolution of quantum states 
was introduced by Wigner in his paradox of a friend ^] . Wigner considered a physical system that can be described 
as "(observed object+ observeri)+observer2" . The hypothetical nonlinearity was to make the off-diagonal elements 
of the density matrix of the "object -|-observeri" system disappear, since once the result is consciously observed, no 
further interferences should be possible. Such a goal can be acheived either by a "collapse" of a state vector, or 
in some "no-collapse" way (decoherence approach). The sugestion of Wigner, contrary to the oppinion of Penrose 
expressed in |^^, belonged to the latter class of theories. 

A modification of the observed (linear) system can exert an influence on the observer; the Gisin's telegraph belongs 
to such a class of phenomena, so may not be very pathological in the context of measurements. On the other hand, 
the telegraph based on the mobility phenomenon would lead to telekinetic phenomena [Q. Indeed, consider a pre- 
measurement that produces an entangled state of a system and an observer. Let the observed system be a two-level 
atom and an electromagnetic field, initially in the state 

|1)|1) + |2)|0) = |11) + |20) (123) 

where the notation is analogous to this from the previous section. Assume that a Hilbert space of the observer is also 
spanned by two states |±). After the pre-measurement the state of the joint system is 

|ll)|+) + |20)|-). (124) 

If now the observer's state space undergoes a nonlinear evolution with the "(T3" nonlinearity like in ( |8l| ) then two 
possibilities occur. First, if only the |±) start to rotate with the mobility frequency then the atomic reduced density 
matrix is like in linear QM. However, if the nonlinearity violates the orthogonality of the states |l)|-l-) := |l-l-) and 
|0)|— ) := |0— ) (i.e. when the observer makes both his own consciousness and the interacting photons evolve in a 
nonlinear way) then the atom starts to "feel" it. It follows that, at least in principle, a suitable form of his "own" 
nonlinearity can enable the observer influence in a statistically observable way a behavior of a random generator just 
by watching it! Notice that there is also some limitation on the possibility of the influence. For consider a situation 
where there is a number of intermediate states between the random generator and the observer, so that the entangled 
state takes the form 

|l)| + l)|+2)...|+„)|+) + |2)|-l)|-2)...|-„)|->- 

The random generator will "feel" the mobility only provided the mobility will involve the underbraced states 

|1) 1 + 1)1+2)... |+n)|+)+|2) 1-1)1-2)... |-„)|->. 

In case the nonlinearity is more localized, say 

|l)| + l)|+2)...|+„)|+)+|2)|-l)|-2)...|-„)|-) 
^ ^ ^ ^ ^ ^ 

then the generator's density matrix does not contain the "malignant" terms (one cannot influence the generator by 
watching it on TV; or, it is much easier to influence with my thoughts my own finger than someone else's). 

The above phenomenon is present only if we assume the kind of description a la Weinberg. One may hope that 
at least in the "correct" description no ways of exerting observer's influence on external world by "thoughts" or 
"intentions" exist. The following surprising example is a slightly modified version of the phenomenon noticed by 
J. Polchinski J 23| . This is the only paradox that I was not able to eliminate in the Nambu-like generalization of QM 
proposed in 

Consider a process involving four steps. (1) A spin-1/2 ion enters a Stern-Gerlach device, which couples to the 
linear spin CT3 component and the beam splits into two, "-I-1" and "—1" sub-beams. (2) The "-|-1" beam evolves freely; 
in the path of the "—1" beam a macroscopic observer (or a random generator) takes one of two actions: (a) does 
nothing (say, with probability Ai), or (b) rotates the spin into the "1" direction with a magnetic field coupled to (T2 
(with probability A2). (3) The two beams are rejoined and the ion enters a region of field coupled to the nonlinear 
observable 

Ir p 

(4) The observer again measures the spin with a Stern-Gerlach device coupled to (T3. 
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The steps 1 and 2 prepare the initial condition for the nonhnear evolution, and the reduced density matrix of the 
ion after step 2 is 

The ion's nonlinear Hamiltonian for the step 3 is 

H = 2/^., (127) 

where we have assumed the "correct" form of the evolution, i.e. that it is generated by the Hamiltonian function 
depending only on the reduced density matrix of the ion. The reduced density matrix satisfies 

ip = 2/^^[ai,p] (128) 
ir p 



whose solution is 



p{t) = e-2'/('^i><^itp(o)e2'/<<^i><^i* 



= yl + ^(21 + CTi+CT3Cos2A2/t + CT2sin2A2/i). (129) 

In the analogous manner we can calculate the evolution of the projector P± = ^(1 ± 0-3). We find in the "Heisenberg 
picture" (i.e. we solve the Heisenberg equations of motion with the nonlinear Hamiltonian) 

P± (t) = ^ (1 ± C73 cos 2A2/i T CT3 sin 2A2/i) . (130) 

The linear case, where in step three the ion couples with the same coupling constant linearly to cti, would yield 

P±{t) = ^(l±c73COs2/fTCT3sin2/t). (131) 

Assuming that the time of the interaction during the third step satisfies 2ft = tt we get, in the linear case, 

P±(t) = ^(lT<73) =Pt(0), (132) 

which means that the spin changes its sign during the evolution. In the nonlinear case, however, 

P± (t) = i ^1 ± (T3 cos A27r =F 0-3 sin A27r^ (133) 

hence, in particular, the evolution of P+ depends on A2 — the probability of one of the two actions taken by the 
observer or the random generator in case the spin turned out to be —1. The result means that the evolution of the 
ion depends on "intentions" of the observer concerning his possible actions he would have undertaken had the spin 
of the ion turned out to be —1 — even in case the spin is +1 and the observer is passive! For example, for A2 = 0, 
that is when the observer is decided not to take any actions, the spin state of the ion would remain unchanged. In 
the opposite case, A2 = 1, the evolution would be like in linear QM. 

It seems that the essential point of this argumentation is the assumption that a single member of a beam of ions is 
described by the same density matrix as the whole ensemble. This is exactly opposite to the reasoning leading to Gisin's 
telegraph. And I think this example indicates one of the most fundamental conceptual, or practical, difficulties of 
nonlinear QM: The fact that we do not really know how to treat beams of siglc objects. Intuitively, weak beams should 
be "linear" while strong ones could be, perhaps, "nonlinear" in some mean-field sense. The second hint for futher 
generalizations of linear QM is a possibility, suggested by Wigner, that the only domain of fundamental nonlinearities 
could be the consciousness of observer. Then the density matrix representing the consciousness could evolve nonlinearly 
and no decompositions of the observer into sub-cnsombles would make any sense. Still, the Polchinski's phenomenon 
can describe something like intuition: A perception of a single event depends on the overall property of an ensemble 
of such events since the whole density matrix is involved. 

A question that arises immediately is how can we distinguish between systems that are "conscious" (observers) 
and "non-conscious" . If one wants to make such distinctions more formal and without introducing the state vector 
reduction postulate, one has to use the language of the information theory. In the second part of this paper I shall 
discuss a generalization of QM based on Renyi entropies. Such entropies naturally select a class of systems that can 
gain information. It will be shown that the possibility of gaining information by a quantum system is naturally related 
to the nonlinearity of evolution. 
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